Knowledge-based sentence semantic similarity: algebraical properties
نویسندگان
چکیده
Abstract Determining the extent to which two text snippets are semantically equivalent is a well-researched topic in areas of natural language processing, information retrieval and summarization. The sentence-to-sentence similarity scoring extensively used both generic query-based summarization documents as significance or indicator. Nevertheless, most these applications utilize concept semantic measure only tool, without paying importance inherent properties such tools that ultimately restrict scope technical soundness underlined applications. This paper aims contribute fill this gap. It investigates three popular WordNet hierarchical measures, namely path-length, Wu Palmer Leacock Chodorow, from algebraical intuitive properties, highlighting their limitations theoretical constraints. We have especially examined related range score, incremental monotonicity evolution, with respect hyponymy/hypernymy relationship well set interactive properties. Extension word sentence has also been investigated using pairwise canonical extension. Properties scrutinized. Next, overcome terms accounting for various Part-of-Speech categories, “All word-To-Noun conversion” makes use Categorial Variation Database (CatVar) put forward evaluated publicly available dataset comparison some state-of-the-art methods. finding demonstrates feasibility proposal opens up new opportunities processing tasks.
منابع مشابه
A Link Grammar and Semantic Corpus Based Sentence Similarity Measure
A novel sentence similarity measure that based on grammar and semantic corpus is presented. The well-known problem in the field of semantic processing, such as natural language processing, QA systems, expert systems, search engines, etc., is trying to evaluate the semantic similarity between sentences or articles. The major challenge is to evaluate the similarity of sentence-vs.-sentence since ...
متن کاملCorpus-based and Knowledge-based Measures of Text Semantic Similarity
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and knowledge-based measures of similarity. Previous work on this problem has focused mainly on either large documents (e.g. text classification, information retrieval) or individual words (e.g. synonymy tests). Given that a large fraction of the information available today, on the Web and elsewhere,...
متن کاملSentence Similarity based on Relevance
The issue about sentence similarity is essential to many areas of artificial intelligence. Although there are related studies on determining text similarity, fewer publications are about the similarity between short texts especially about the similarity between sentence pairs. In this paper, a novel method is proposed to estimate the sentence similarity with consideration of direct relevance an...
متن کاملA Sentence Semantic Similarity Calculating Method Based on Segmented Semantic Comparison
In order to calculate sentence semantic similarity more accurately, a sentence semantic similarity calculating method based on segmented semantic comparison was proposed. Sentences would be divided into the trunk and the other segments by some grammar rules, and each segment might be divided into several shorter segments. When calculating the sentence semantic similarity between two sentences, ...
متن کاملWord Semantic Similarity Calculation Based on Domain Knowledge and HowNet
Word semantic similarity is the foundation of semantic processing, and is a key issue in many applications. This paper argues that word semantic similarity should associate with domain knowledge, which traditional methods did not take into account. In order to adopt domain knowledge into semantic similarity measurement, this paper proposed a sensitive words sets approach. For this purpose, we a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Progress in Artificial Intelligence
سال: 2021
ISSN: ['2192-6352', '2192-6360']
DOI: https://doi.org/10.1007/s13748-021-00248-0